Confirming protein-protein interactions by text mining
نویسندگان
چکیده
Motivation: Although manual curation of proteinprotein interactions from literature resulted in several large databases, many interactions are still available only in manuscripts. Though PubMed does include a search engine, protein-protein interactions remain difficult to find in an automated manner. Results: OPHID Text Miner (OTM) is an information extraction system dedicated to finding specific proteinprotein interactions in PubMed abstracts. Originally designed to validate predicted interactions, it can be used to provide additional support for researchers. Using several layers of pattern matching, OTM can extract proof for interactions between two proteins with 47% recall and 93% precision. Availability: OTM’s results have been integrated into OPHID (Online Predicted Human Interaction Database; http://ophid.utoronto.ca). Additional information regarding interaction terms and synonym databases are available upon request. Contact: [email protected]
منابع مشابه
Collection-Wide Extraction of Protein-Protein Interactions
Evidence in support of relationships among biomedical entities, such as protein-protein interactions, can be gathered from a multiplicity of sources. The larger the pool of evidence, the more likely a given interaction can be considered to be. In the context of biomedical text mining, this elementary observation can be translated into an approach that seeks to find in the literature all availab...
متن کاملUsing Biomedical Literature Mining To Consolidate The Set Of Known Human Protein-Protein Interactions
This paper presents the results of a largescale effort to construct a comprehensive database of known human protein interactions by combining and linking known interactions from existing databases and then adding to them by automatically mining additional interactions from 750,000 Medline abstracts. The end result is a network of 31,609 interactions amongst 7,748 proteins. The text mining syste...
متن کاملBiological Text Mining for Extraction of Proteins and Their Interactions
Text mining techniques have been proposed for extracting protein names and their interactions. First, we have made improvements on existing methods for handling single word protein names consisting of characters, special symbols, and numbers. Second, compound word protein names are extracted using conditional probabilities of the occurrences of neighboring words. Third, interactions are extract...
متن کاملNegatome 2.0: a database of non-interacting proteins derived by literature mining, manual annotation and protein structure analysis
Knowledge about non-interacting proteins (NIPs) is important for training the algorithms to predict protein-protein interactions (PPIs) and for assessing the false positive rates of PPI detection efforts. We present the second version of Negatome, a database of proteins and protein domains that are unlikely to engage in physical interactions (available online at http://mips.helmholtz-muenchen.d...
متن کاملProtein Interactions Extracted from Genomes and Papers
To assess the feasibility of extracting protein interactions from text we have recently organized the BIOCREATVE II challenge (http://biocreative.sourceforge.net) in collaboration with the MINT and INTACT databases. The competition was divided in four sub-tasks: a) ranking of publications by their relevance on experimental determination of protein interactions, b) detection of protein interacti...
متن کامل